Event Identification and Tracking in Social Media Streaming Data
نویسندگان
چکیده
In recent years, the growing popularity and active use of social media services on the web have resulted in massive amounts of user-generated data. With these data available, there is also an increasing interest in analyzing it and to extract information from it. Since social media analysis is concerned with investigating current events around the world, there is a strong emphasis on identifying these evens as quickly as possible, ideally in real-time. In order to scale with the rapidly increasing volume of social media data, we propose to explore very simple event identification mechanisms, rather than applying the more complex approaches that have been proposed in the literature. In this paper, we present a first investigation along this motivation. We discuss a simple sliding window model, which uses shifts in the inverse document frequency (IDF) to capture trending terms as well as to track the evolution and the context around events. Further, we present an initial experimental evaluation of the results that we obtained by analyzing real-world data streams from Twitter.
منابع مشابه
An Online Visual Search Engine for Mining Streaming Text Data in Real-Time
The ever-increasing scale of streaming texts presents a fundamental challenge to analyzing, visualizing and discovering useful information among the endless rivers of text available in social media. In this paper, we present an online visual search engine that efficiently handles querying and retrieval of text streams of interest for understanding streaming tweet data. With regards to the user-...
متن کاملDiscovering Credible Events in Near Real Time from Social Media Streams
Title of dissertation: DISCOVERING CREDIBLE EVENTS IN NEAR REAL TIME FROM SOCIAL MEDIA STREAMS Cody Buntain, Doctor of Philosophy, 2016 Dissertation directed by: Professor Jennifer Golbeck School of Information Recent reliance on social media platforms as major sources of news and information, both for journalists and the larger population and especially during times of crisis, motivate the nee...
متن کاملTwitter event detection: combining wavelet analysis and topic inference summarization
Today streaming text mining plays an important role within real-time social media mining. Given the amount and cadence of the data generated by those platforms, classical text mining techniques are not suitable to deal with such new mining challenges. Event detection is no exception, available algorithms rely on text mining techniques applied to pre-known datasets processed with no restrictions...
متن کاملReal-time Detection of Content Polluters in Partially Observable Twitter Networks
Content polluters, or bots that hijack a conversation for political or advertising purposes are a known problem for event prediction, election forecasting and when distinguishing real news from fake news in social media data. Identifying this type of bot is particularly challenging, with state-of-the-art methods utilising large volumes of network data as features for machine learning models. Su...
متن کاملTowards Social Event Detection and Contextualisation for Journalists
Social media platforms have become an important source of information in course of a breaking news event, such as natural calamity, political uproar, etc. News organisations and journalists are increasingly realising the value of information being propagated via social media. However, the sheer volume of the data produced on social media is overwhelming and manual inspection of this streaming d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014